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Abstract For given a pair of nodes in a graph, the minimum non-separating 
path problem looks for a minimum weight path between the two nodes such 
that the remaining graph after removing the path is still connected. The bal- 
anced connected bipartition (BCP2) problem looks for a way to bipartition a 
graph into two connected subgraphs with their weights as equal as possible. 
In this paper we present an 0{N log N) time algorithm for finding a mini- 
mum weight non-separating path between two given nodes in a grid graph of 
N nodes with positive weight. This result leads to a 5/4- approximation algo- 
rithm for the BCP2 problem on grid graphs, which is the currently best ratio 
achieved in polynomial time. We also developed an exact algorithm for the 
BCP2 problem on grid graphs. Based on the exact algorithm and a rounding 
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technique, we show an approximation scheme, which is a fully polynomial time 
approximation scheme for fixed number of rows. 

Keywords algorithm • approximation algorithm • non-separating path • 
balanced connected partition • grid graphs. 

1 Introduction 

Let G = {y, E, w) be a connected undirected graph, in which w is a nonneg- 
ative node weight. For two given nodes s and t, if we want to allocate some 
of the nodes for the communication between s and t, choosing a minimum st- 
path (a minimum weight path with endpoints s and t) may be the best way. 
However, if the chosen nodes cannot be used for other services, the remain- 
ing network may be separated into species. To keep the remaining network 
connected, one may hope to find a minimum non-separating st-path, i.e., a 
st-path P such that the remaining graph G — P is connected. However, in 
general, non-separating path does not always exist. A natural relaxation al- 
lows any connected subgraph containing both s and t. That is, we look for 
a minimum weight connected subgraph B containing both s and t such that 
G — B is connected. We name such a subgraph by non-separating st-connector 
(st-NSC or simply "NSC" if no confusion) and the one with minimum weight 
is the minimum non- separating connector (min-NSC). 

A node bipartition {U, V — U) is a. connected bipartition if both the sub- 
graphs induced by U and V — U are connected. Immediately if B is an NSC, 
{V{B), V—V{B)) is a connected bipartition. The maximum balance connected 
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bipartition (BCP2) problem looks for a connected bipartition ([/, V — U) such 
that the balance, defined by min{w{U),w{V — U)}, is maximized, in which 
w{U) denotes the total weight of nodes in U. The applications of BCP may 
appe ar in image processing , data bases, operating systems, cluster analysis, 



etc. ( Chataigner et al 



120071 ) ■ An m X n grid graph M is an undirected graph 
and can be thought of as a 2-dimensional matrix, in which m and n are the 
numbers of rows and columns, respectively. The node set of M can be repre- 
sented hy V — {Mij\l < i < m,l < j < n} and there exists an edge between 
two consecutive nodes in the same row or the same column. In this paper we 
study the min-NSC and the BCP2 problems on node-weighted grid graphs. 

The non-separating path problem has been studied from the perspective of 



tivity 


(Bol 


obas and A. Thomason 


1996; 


Chen et al. 


2003 


Kawarabavashi et al. 


2005, 


2008 


Tutte 


1963 


), but we haven't found any optimization problem about 



it. The min-NSC problem on general graphs is NP-hard in the strong sense 
and cannot be app roximated with rati o for any e > in polynomial 



time unless P=NP ( Wu and Chen 



20091 named "minimum border problem" 



In this paper we show that a minimum st-NSC on a grid graph is a minimum 
non-separating st-path and can be found in O(A'^logA^) time, in which N is 
the number of nodes. The efHcient algorithm is based on two key points. First, 
the min-NSC on a grid graph is a minimum weight path with at most one 
boundary subpath; and secondly, such a path can be found by reducing to a 
range minimum query (RMQ) problem. 
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The second result of this paper is about the BCP2 problem on grid graphs 
(GBCP2 for short). Based on NSC and si- numbering, we propose a 5/4- 
approximation algorithm with time complexity 0{N log N) for the GBCP2 
problemn, which is the currently best result achieved in polynomial time. We 
also developed an exact algorithm for GBCP2- For an to x n grid graph of 
total weight W, m < n, the algorithm takes 0{mNW8"'') time, which is more 
efficient than the naive brute force method of 0{N2^) time. The exact algo- 
rithm uses a typical dynamic programming strategy and computes the best 
bipartition for any possible weight and any connection topology of the first i 
columns for i from 1 to n. The analysis itself is of its own interest. An obvious 
upper bound of the number of connection topologies is m™. With a more pre- 
cise analysis and using the method of generating function, we show a sharper 
bound of 0(2™). Based on the exact algorithm and a rounding technique, we 
developed an approximation scheme. For any e > 0, the GBCP2 problem can 
be (l-l-e)-approximated in 0((1 -I- i)TO7V^8'") time, which is a. fully polynomial 
time approximation scheme (FPTAS) for fixed to. 



The BCPq problem is a generalization of BCP2, for which the input graph 
is partitioned into q connected subgraphs for any given q > 2. Previous re- 
sults about BCP2 are as follows. The BCP2 on grid graphs of more than 



two rows was shown to be NP-hard (Becker et al 



19981 ) while for grid graphs 



of two rows, a.k.a . ladders, the problem can be solved in polynomial time 



(Becker et al. 



20011 ). App roximation algorithm s for BCPg on grid graphs were 



also presented by Becker 



Becker et al 



(|l998r ) but the general approximation 
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ratios were not given, except for the case q = 2, for which a 3/2-approxiniation 
can be guaranteed. Besides the la dders, it is known that the BCPg problem 



is polynomially sol vable for tree s ( Perl and Schach 



connected graphs (jLovasz 



mm- 



Chlebikova 



198ll) and unweighted q- 



19961) showed that BCP2 on 



general graphs is NP-hard in the strong sense and cannot be approximated 
with an absolute error guarantee of for any e > unless NP=P. A 4/3- 

approximation algorithm was also given in that paper, which is currently the 
best approximation ratio of the problem, even on grid graphs. For BCP3 and 
BCP4, on 3- and 4-con nected graphs respective ly, there are 2-approximation 



algorithms proposed by 



Chataigner et al. 



torn . 



The rest of the paper is organized as follows: In Section 2, we give some no- 
tations and show that the min-NSC is a minimum non-separating path in a grid 
graph. In Section 3, we show the algorithm for the minimum non-separating 
path. The 5/4-approximation algorithm for GBCP2 is given in Section 4. The 
exact algorithm and the approximation scheme of GBCP2 are in Sections 5 
and 6, respectively. Finally, some concluding remarks are given in Section 7. 



2 Preliminaries 

Let G — {V, E) be a graph, S d V and H an induced subgraph of G. The 
subgraph induced by S is denoted by G[S']. By G — we denote G\V — 5], 
and similarly G-H = G[V - ViH)], in which V{H) is the node set of H. Let 
w : V Z+ be a node weight. By w{S), we denote the total weight of S, i.e., 
'"^('5') — X]ues'^(^)' ^'^^ convenience w{H) = w{V{H)). An m x n grid graph 
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Fig. 1 The minimum NSC and the minimum non-separating (induced) st-path. The nodes 
are labeled by their weights. The minimum st-NSC consists of the four nodes of weight 1 (in 
addition to s and t); the minimum non-separating st-path passes through the three nodes of 
weight 2; and the minimum non-separating induced st-path passes through the two nodes 
of weight 5. 

M will be thought of as an m x n matrix such that V{M) — {Mij\\ < i < 
m,l < j < n} and there exists an edge between two consecutive nodes in the 
same row or the same column. The set of nodes in the first or the last row and 
the first or the last column is called as the boundary of the grid graph. The 
four nodes Afi.i, Afi,„, M,n,i and Mmn are called as corner nodes. W.l.o.g. we 
assume n > m. Let N = mn — \V{M)\ and Wij the weight of node M^-. 

Let Ci, C2 and C3 be the sets of all non-separating induced st-paths, non- 
separating si-paths and st-NSCs, respectively. By definition, Ci C C2 C C3. 
For general graphs, they are different and there may be even no any non- 
separating st-path. Figure [1] illustrates a case that the minimum NSC and the 
non-separating (induced) si-path are different. In the remaining of this section 
we show that Ci — C2 — C3 on a grid graph except that {s, i\ is a 2-cut. 
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A node subset is a cut if the graph becomes disconnected after its removaL 
A 2-cut is a cut consisting of two nodes. For a grid graph of at least three rows, 
the only 2-cuts are the pairs of the two neighbors of corner nodes. Suppose 
that {s, t} is a 2-cut of an m x n grid graph G, in which n > m > 3. Let x be 
the corner node adjacent to both s and t. Apparently the minimum sf-NSC 
is either the path {s,x,t) or G — x, depending on which weight is smaller. 
The minimum non-separating path is similar but a little tricky. It is not hard 
to observe that the minimum non-separating si-path is either [s,x,t) or a 
Hamiltonian si-path of G — a: if the Hamiltonian path exists. If both m and 
n are odd integers, we can show that there does not exist a Hamiltonian si- 
path in G — x as follows. First we color s white, and then all other nodes are 
colored according to the rule: "the neighbors of a white node should be color 
black and vice versa" . It can be easily checked that t is colored white and that 
the numbers of white and black nodes are the same. Since a path with both 
endpoints colored white cannot have equal number of white and black nodes, 
no Hamiltonian si-path exists on G — a;. On the contrary, if m or n is even, we 
can show that a Hamiltonian si-path always exists. W.l.o.g we assume that x 
is the corner node Afo,o- The Hamiltonian paths for even number of rows, and 
columns respectively, are illustrated in Fig. [51 

Anyway, if {s,i} is a 2-cut, both the min-NSC and the minimum non- 
separating path can be easily computed, and we shall assume it is not the 
case in the remaining of the paper. For an NSC _B or a connected partition 
{V{B), V — V {B)) , a node v is movable if it is still a connected bipartition after 



Fig. 2 The Hamiltonian st-paths in G — x for even number of rows and even number of 
columns are shown in the left and the right, respectively 



moving v to the other part. A trivial observation is that v G V{B) is movable iff 

V is not an articulation node in B and v has a neighbor in — V{B), assuming 

V — V{B) is not empty. For a connected bipartition of a biconnected graph, 
there are at least tw o movable nodes in each part unless the part contains 



less than two nodes ( Chlebi'kova 



19961) . The following result comes from the 



minimality of the NSC. 

Lemma 1 If B is a minimum st-NSC of a biconnected graph, both s and t 
are movable and there is no other movable node in B. 

Theorem 1 // {s, is not a 2-cut, a minimum st-NSC B on a grid graph 
M of at least three rows is a non- separating induced st-path. 

Proof By definition M — B is connected, and it is sufficient to show that B 
is an induced st-path. A block is either a maximally biconnected subgraph or 
a bridge (an edge whose removal disconnects the graph). Let /C and A be the 
sets of the blocks and the cut ver tices of i?, respectively. The block- cutpoint 



tree T of B is defined as follows ( West 



2001 



Chap. 4). V{T) = /C U A, and 



9 



for any a ^ A and K E IC, {a, K) E E{T) iff, in the original graph B, a is a 
vertex in block K. By definition T is a tree and each leaf of T corresponds to 
a block of B. 

Since M is biconnected, for any leaf K of T, there is a node in the block 
K oi B which is adjacent to M — B and therefore movable. By Lemma [TJ s 
and t are the only movable vertices in B. So T has at most two leaves, and is 
therefore a path. We shall show that each block is an edge, and the proof is 
completed. Suppose by contradiction that K is a block of B and |y(i^)| > 2. 
Since M is a grid graph of at least three rows, there are at least three nodes 
in K adjacent to M — if unless K contains all nodes but a corner (in this case 
K = B). li K contained all nodes but a corner node, s and t would be the two 
neighbors of the corner node and therefore a 2-cut, violating the assumption. 
A node in K and adjacent to AI — K is either 

— a cut of B and therefore adjacent to a component oi B — K; or 

— adjacent to Af — _B and therefore a movable node of B because no node of 
a block of more than two nodes is a cut of the block. 

Since M is biconnected, there is a movable node of B in each component of 
B — K. Therefore there are at least three movable nodes in B, and by Lemmafl] 
B is not a minimum si-NSC. □ 

Corollary 1 For a grid graph of at least three rows, the minimum NSC, the 
minimum non- separating path and the minimum non- separating induced path 
are all equivalent except that s and t are the two neighbors of a corner node. 
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3 Minimum non-separating path 

In this section we show how to find a minimum non-separating path efficiently. 
A boundary path is a path along the boundary, i.e., all the nodes are on 
the boundary. A interior path is a path whose internal nodes are not on the 
boundary. In the following we shall denote by B the boundary of M. For a 
path P, a boundary subpath of P is a maximal subpath of P with all nodes 
on the boundary, i.e. a boundary subpath of P is not a subpath of another 
boundary subpath of P. A subpath may contain only a single node. 

Lemma 2 A non- separating induced st-path has at most one boundary sub- 
path. 

Proof If P is an induced path and has more than one boundary subpaths, there 
are at least two boundary segments divided by P, and each of these segments 
is in one component of M — P, which implies P is not non-separating. □ 

Lemma 3 If P is a minimum st-path with at most one boundary subpath, 
then P is a non- separating induced path. 

Proof Let P = (s = vi,V2, . ■ ■ ,vi = t). First we show that P is an induced 
path by contradiction. If not, there exist Vi and Vj such that (vi,Vj) G E and 
i < j — 2. Since removing a subpath from a path will not increase the number 
of boundary subpaths, we can replace the subpath from Vi to vj with the edge 
{vi, Vj) to obtain a path of less weight and with at most one boundary subpath, 
a contradiction to the minimality of P. 
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For any induced path, the subgraph induced by any of its node subset is 
not a cycle. Therefore P cannot include the whole boundary. Since P has at 
most one boundary subpath, the remaining boundary nodes B — V{P) are 
connected and not empty. If P was not non-separating, there would be an 
interior node separated from the remaining boundary by P. Since there is no 
induced cycle in V{P) and P only has at most one boundary subpath, it only 
happens at the case that the remaining boundary contains only a corner node, 
which implies that P is a boundary path including the whole boundary but 
the corner node, i.e., s and t are the two neighbors of the corner node. But 
this contradicts the assumption that {s, i) is not a 2-cut. □ 

By the above results, a minimum NSC is a minimum non-separating path, 
and is also a minimum st-path with at most one boundary subpath. This prop- 
erty is helpful for designing our algorithm. The next corollary is immediate. 

Corollary 2 If both s and t are on the boundary, a minimum non- separating 
st-path is a boundary path and can be found in time linear to \B\. 

For two nodes u and v, let dB{u,v), and di{u,v), denote the minimum 
weight of any boundary path, and interior path respectively, between u and 
V. For dBiu,v), both u and v must be on the boundary. Let d{u,v) denote 
the minimum weight of any non-separating induced uv-pa,th. We can have the 
next lemma. 

Lemma 4 If s is on the boundary and t is not, d{s,t) = minjge{dB(s, i) -|- 
di{i,t) - w{i)}. 
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Proof By Lemma [3J it is sufficient to compute the minimum weight of any 
st-path witli at most one boundary subpath. Since s is on the boundary, the 
optimal path must be a concatenation of a boundary subpath and an interior 
subpath, including the degenerating case that the boundary subpath is only 
one node, i.e., s in this case. □ 

For each u of the four corner nodes, the value di{v,u) = oo for any v. It 
is trivial that the total time complexity to compute (is(s, i) for every i G B is 
linear to \B\. To compute d{s,t), it is sufficient to find di{t,i) for every i ^B, 
and then the minimum can be found in 0(|i3|) time. Since any boundary node i 
other than a corner node has only one interior neighbor, di{t^ i) is the minimum 
weight of any path between t and the interior neighbor of i on AI — B. Since 
a minimum weight path can be found by an algorithm similar to Dijkstra's 
algorithm and the number of edges in a grid graph is linear to the number 
of nodes, the time co mplexity is dominated by t he one of Dijkstra algorithm, 



which is 0{N log N) (jFredman and Tarian 



19871) 



Corollary 3 If s is on the boundary and t is not, d(s, t) can be found in 
0{N log N) time. 

The proof of the next result is similar to Lemma |4] and is omitted. 

Lemma 5 // neither s nor t is on the boundary, d{s,t) is the minimum be- 
tween di{s, t) and 



min{d/(s,i) + dsii, j) + di{j,t) - w{{i,j})} (1) 
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Note that if the optimal path contains a boundary node, it must contain 
at least two boundary nodes since a boundary node has at most one interior 
neighbor. The time for computing c?/(s, t) is 0{N log N) by Dijkstra algorithm. 
To compute ([1]), a naive algorithm of checking all possible i and j takes quadric 
time. We shall give an 0(|;B|) time algorithm in the following. 

Starting at an arbitrary boundary node, we number the boundary nodes 
clockwise from 1 to \B\. Let w{i) be the weight oi i £ B and Wb denote the 
total weight of boundary nodes. Our algorithm has two rounds. In the first 
round, we find for every i the best j such that the minimum boundary path 
from i to j is clockwise. The other case that the path is counterclockwise is 
checked in the second round. Since the two rounds are similar, we shall only 
show the first round. For convenience, we double the whole sequence, i.e., the 
+ i)-th node is the same as the i-th node for 1 < i < \B\. The minimum 
boundary jj-path is clockwise if 



Therefore we define right(i) as the maximum index j in + \B\ — 1] such 
that the path is clockwise. 

Since right(i) is a increasing function, it is not hard to show that com- 
puting right(i) for every i can be done in totally 0(|;B|) time. For every i 
from 1 to \B\, we find j* G (i,right(i)] minimizing di{s,i) +J2i<k<j-' + 
di(t,j*). For a fixed i, equivalently we only need to find j* minimizing <j{j*) — 
Ei<fc<j' ^e*^) ~^di{t,j*). By this way we reduce our problem to a range mini- 
mum query (RMQ) problem. For a one dimensional array, there is an algorithm 




i<k<j 
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which reports the minimum in any ind ex range in constant time after a hnear 



time preprocessing (j Bender et al 



20051 ) ■ For our problem, the array a has 2\B\ 



elements and we need to perform |i3| queries. Therefore the time complexity 
for finding indices i and j minimizing ([1} is 0(|;B|). Clearly cr(j) for every j 
can be found in 0(7Vlog A^) time. Along with Corollaries [5] and HI we have the 
next result. 



Theorem 2 A minimum non-separating path on an N -nodes grid graph can 
be found in 0(N log N) time. 



4 A 5/4-approximation algorithm for GBCP2 

In this section we show an O(A^logA^) time 5/4-approximation algorithm for 
the GBCP2 problem by using the minimum non-separating path. Let W be 
the total weight of all nodes. For the GBCP2 problem, the optimal solution 
is trivial if there exists a node of weight at least W/2. Therefore we shall 
exclude this case in the following. We shall first introduce an algorithm for 
finding a bipartition of any biconnected graph but not necessarily of a grid 
graph. This algorithm uses the st-numbering and finds a 4/3-approximation 
of BCP2. Then we show how to improve the ratio to 5/4 for grid graphs by 
using the minimum non-separating path. 
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4.1 An algorithm based on st- numbering 

For a biconnected undirected graph G = (F, E) and s,t £V ^ an si-numbering 
is a 1-to-l labehng \ :V {l..n}, n = \V\, satisfying A(s) — 1 and A(i) — n; 
and, for each node v G V — {s,t}, v has a neighbor with label smaller than 
X{v) and also a neighbor with label larger than X(v). An st-numberin g of a 



19761). The 



biconnected graph can be found in linear time (jEven and TarianI 
original algorithm for st-numbering requires that s and t must be adjacent. 
But, if (s,i) ^ E, we can simply add edge (s,i) and run the algorithm to 
obtain an st-numbering for any s and i in a biconnected graph. 

For an st-numbering, let Vk ~ {v\X{v) < k}. The next property can be 
shown by the definition of st-numbering. 

Claim For any 1 < k < n, the bipartition (Vk, V — Vk) is a connected biparti- 
tion. 

Proof For any node v G Vk, since each node has a neighbor with smaller 
st-number, there exists a path from w to s in G'[Vfc]. Consequently G'[Vfc] is 
connected. It can be shown similarly that G[V — Vk] is also connected. 

□ 



Algorithm STN 

Input: A biconnected graph G ~ (V, E). 
Output: A connected bipartition G. 

1: let s and t be the nodes of the largest and the second largest weights; 
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2: compute an st- numbering A; 

relabel the nodes such that V — {vi\l < i < n} and X{vi) = i,Vi; 
3: find k* = argmaxj: min{w(Vfe), W — w{Vk)}; 
4: output (,Vk.,V -Vk'). 



Lemma 6 The algorithm STN takes linear time. IfVk" is the solution output 
by the algorithm, then min{i(;(V/c-> ), W — wiVk*)} > {W — W3)/2, where W3 is 
the third largest node weight in G. 



Proof Due to lEven and TarianI (jl976[ ). the si- numbering in Step 2 can be 



computed in linear time. Step 3 can be done as follows. First, in linear time, 
we find i such that wiyi) < W/2 and wiyi+i) > W/2. It is trivial that 
k* is either i or i + 1. By a simple computation, we have that /c* = i if 
w{Vi) >W — w{Vi+i); and k* = i + l otherwise. The total time complexity is 
linear. 

Furthermore, by the optimality of fc*, the weight difference between Vk* 
and V — Vk* is at most w{vi+i). That is, \w{Vk') — {W — w{Vk* j)\ < 
which implies mm{w(yk*),W — w{Vk')} > (W — u'(t;i-(-i))/2. Since no node 
has weight > W/2, 1 < i + 1 < n, i.e., Wi+i is neither s nor t. Therefore 
w{vi+i) < W3. □ 
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4.2 A 5/4-approxiniation algorithm 



By Lemma [S] and the analysis in (jChlebfkovalll996l ). it can be shown that the 



algorithm STN is a linear time 4/3-approximation algorithm for the BCP2 
problem on any biconnected graph. Of course it works for grid graphs since a 
grid graph is biconnected. The remaining paragraphs of this section aims at 
improving the approximation ratio to 5/4. In the remaining we assume G is an 
m X n grid graph and = m x n, in which n > m > 3. Let H = {hi\w{hi) > 
be the set of heavy nodes. Clearly | < 4. We shall show how to find 
a 5/4-approximation solution for each possible value of \H\. The minimum 
non-separating path will play an important role in the case of | = 3. 

Claim When \H\ < 2, the algorithm STN finds a 5/4-approximation. 

Proof In this case the third largest node weight W3 is at most T4^/5. By 
Lcmma[6l the returned bipartition satisfies mm{w{Vk* ),W — w{Vk* )} > {W — 
W/5)/2 = (2/5)M^. Comparing with the trivial upper bound W/2, the ratio is 
5/4. □ 

Claim When \H\ — 4, the algorithm STN finds a 5/4-approximation. 

Proof Since the nodes are arranged in a linear order by their si-numbers. There 
always exists a bipartition, said V, such that both the two parts contain exactly 
two heavy nodes and are of weight larger than (2/5)1^. Since the algorithm 
finds the best bipartition in this linear order, the result is at least as good as 
ofV. □ 

Finally we consider the case that \H\ = 3. 
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Definition 1 Let G be a graph and U a subset of nodes of G. The contracted 
graph G\f7 is the graph obtained by combining all the nodes in J7 by a new 
node u and, for any v ^U, the edge {u, v) exists iff v has a neighbor in U. For 
convenience, G\S = G\V{S) for a subgraph S. 

Lemma 7 If G is biconnected and {U, V{G) — U) is a connected bipartition, 
then G\U is biconnected. 

Proof Since {U, V{G) — (7) is a connected bipartition, the new node u is not a 
cut node in G\U. Since G is originally biconnected, no node in V{G) — U will 
be a cut node in the contracted graph. 

Our 5/4- approximation algorithm for the case of three heavy nodes is as 
follows. 

Algorithm Three_Heavy 

1: let H = {hi, /12, /13} be the set of nodes of weight > W/5; 

2: find a minimum hihj-NSC Bij for each 1 < i < j <3; 

3: let B be the NSC of smallest weight among those found in Step 2; 

4: if w{B) < W/2 then 

5: call algorithm STN on the contracted graph G\B; 
6: else output {V{B), V - V{B)). 

Claim In the case of \H\ = 3, the algorithm Three_Heavy finds a 5/4- 
approximation in 0{N log N) time. 
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Proof By Lenima[71 the contracted graph G\B is biconnected and has exactly 
two nodes of weight > W/5. If w{B) < W/2, the resuh is the same as the 
case of \H\ = 2. For otherwise we claim that {V{B), V ~ V{B)) is an optimal 
connected bipartition as follows. Let {U, V ~ U) he an optimal connected bi- 
partition. Since there are three heavy nodes, at least two heavy nodes must in 
one same part. W.l.o.g. we assume that heavy nodes hi and ^2 are in U. By 
the definitions of NSC and B, we have that w{U) > w{Bi2) > w{B), and this 
implies {V{B), V — V{B)) is an optimal connected bipartition. 

By the result of previous section, B must be an induced path or the whole 
grid graph lacking a corner node. And in either case the contracted graph G\B 
can be easily constructed in linear time. Since the algorithm STN takes also 
linear time, the total time complexity is dominated by the step of finding the 
minimum NSC's, which is 0{N log N) according to Theorem [51 

□ 

We conclude this section as follows. 

Theorem 3 The GBCP2 can he approximated with ratio 5/4 in 0{N log N) 
time. 

5 An exact algorithm for GBCP2 

In this section we develop an exact algorithm for GBCP2. We shall assume 
that the grid graph has at least three rows. For 1 < j < n and a bipartition 
(^1 Vi), we use a vector Zj G {0, 1}™ to represent a bipartition of the j-th 
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column such that zj = if e Vq and is one otherwise, in which Zj denotes 
the i-ih component of Zj for 1 < i < m. 

We shall represent by configuration {z,9,t) a possible bipartition (Vq, Vi) 
of the first j columns such that the partition of the j-th column is z, the weight 
of Vi is 6, and r represents how the nodes of column j are connected in the 
first j columns. That is, for the first j columns, if we delete the nodes of Vi , 
T represents the connected components of the nodes of Vq; and similarly the 
components of Vi if we delete the nodes of Vq. We say r is a connection topology. 
We shall develop a dynamic programming algorithm computing all possible 
configurations for j from 1 to n. We first discuss the connection topology. 

5.1 Connection topology 

To represent a connection topology, it is sufficient to use a data structure 
for disjoint sets. Each set contains the row indices of the nodes in the same 
connected component. Precisely speaking, at column j, we use an array r 
such that r[i] stores the representer of the set Mjj belongs to, in which the 
representer of a set is the smallest row index of its members. For example, 
if, at the first j columns, the nodes M2J, M^j and Mgj are in the same 
component, we record the set {2,3,9} by r[2] = t[3] = r[9] = 2. Since the 
nodes in a component are all in Vq or Vi, we divide a connection topology into 
a O's connection topology, or 0-topology for short, and a 1-topology. That is, 
a g-topology is a partition of {i|zj = q} for column j. A trivial upper bound 
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of the number of connection topologies is m™, and we aim at finding a much 
sharper bound. We shall first consider the 0-topology. 

First of all, if two adjacent nodes are in Vq, they must be in the same 
subset. We transform a column vector into a set of disjoint segments of Os (0- 
segments for short). Labeling the segments by consecutive integers from 1, we 
have an interval [l,p]. Clearly p < \m/2] . For the above example of {2, 3, 9}, 
2 and 3 are the first segment and 9 is the second segment. We shall represent 
the bound by a function of p. In the remaining a 0-topology shall be thought 
of as a partition of {l.-p}. 

Consider the points {{i, 0)|1 < i < p} in the Euclidean plane. The number 
of 0-topologies is the number of ways of adding some lines on one half-plane to 
joint some of these points. If two lines are cross, the corresponding segments 
are in the same component. 

Definition 2 For two subsets and ^2 in a 0-topology T, Si is covered by 
52 if max(S'2) > max(S'i) and min(S'2) < min(5i). A subset is covered if it is 
covered by some other subset and is uncovered otherwise. 

A crucial observation is as follows: 

If T is a 0-topology of and i € S € T, then S — {i} cannot be 

covered by any S' G T- 

Let a{i,i) denote the number of 0-topologies of i segments with j uncovered 
subsets iov 1 < j < i < p. In any 0-topology of i segments, i must be in one 
of the uncovered subsets of a 0-topology of i — 1 segments. For a 0-topology 
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of i — 1 segments with k uncovered subsets, joining i to the j-th uncovered 
subsets wiU form a 0-topology of i segments with j uncovered subsets, for 
j < k < i — 1. Besides, leaving i itself as an uncovered subset along with a 
j — I uncovered subsets also forms a 0-topology of j uncovered subsets, seeing 
Fig. 131(a). Therefore we can have the following recurrence relation. 



fc=i-i 

The boundary conditions are a{i, 0) = and a{i, i) = 1 for any i. 

Lemma 8 For a fixed 0-topology with j uncovered subsets, there are at most 



Proof Clearly the 0-segments interleave the 1-segments, and vice versa. Let 7" 
be a 0-topology. For S ^ T, we say that a 1-segment i is minimally covered by 
S" iff z is covered by S but not covered by any S" G 7" covered by S". As shown in 
Fig. [3)(b), for any S" G T, all the 1-segments minimally covered by S is in the 
same component. So, for the 1-segments covered by at least one subsets in T, 
their components are fixed. Therefore the number of 1-topologies only depends 
on the uncovered 1-segments. If there are j uncovered subsets in S, there are 
at most j + 1 uncovered 1-segments. Let f{k) denote the maximum number 
of 1-topologies with k uncovered 1-segments. Clearly /(I) = 1. For fc > 1, the 
k-th 1-scgmcnt cither forms a component by itself or joins to the (fc — l)-th 
1-segment. Note that since they are uncovered, it cannot join to other segment 
but skip from the (fc — l)-th one. Consequently we have /(fc) = 2/(fc — 1) for 
fc > 1, and it turns out to be /(fc) = 2'="i < 2^. □ 




(2) 



2-' 1-topologies. 
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Joint to one of the uncovered subsets 




(a) (1 -segments are not shown) 

Fig. 3 Connection topologies (the columns are transposed). 



010101010 1010 
(b) 



Corollary 4 For p 0-segments, the number of connection topologies is upper 
bounded by t{p) = J2i<j<p 2^a(P, j)- 

Proposition 1 For any h>0 and positive i, J2k=o ('7-1'') ^ (^f*)' 

Proof By induction on h. First when h = 0, J2k=Q ('7-1'') = (^1) = i'T)- 
Suppose that the inequahty holds for h-1, i.e., J2kZl ('T^t'') ^ C^'i"^)- We 
have 



^ ^i-l + k\ fi-l + k\ fi-l + h 



< 



fc=0 ^ ^ fc=0 

i + h-l\ fi + h-\\_/i + K 
i-l 



i - 1 



□ 



Lemma 9 a{i,j) < ( *j "*) for 1 < j < i. 

Proof By induction on i. When i = I, the only feasible value of j is 1, and 
a{l, 1) = 1 = (^YO' Suppose that it holds for i — 1 and any 1 < j < « — 1. By 

a, 



k=j-l 



k=]-l 



k=0 



i - 1 
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The last equality is obtained by substituting k hy i — I — k. Then, by Propo- 
sition [H a{ij)< (^V)- ° 

Lemma 10 t{p) < - i^^) forp>l. 

Proof By Corollary U and Lemma [5J 

i<j<p i<j<p VP/ j=o \ J / 

The last equality is obtained by substituting j with p — j. We derive a closed 

form of i{p) by the method of generating function. First, 2^'^^ (^^^) is the 

coefficient of in 2^-' (2 + x)p^^ , and is the coefficient of in 2^^x'p^^{2 + 

x)P+3 . Then f (p) is the the coefficient of a;*' in the following generating function: 

T{x) ^ E 2"^'a;^"^'(2 + x)P+^ (3) 



Since 



2x 

— r(a;) = E 2-^+'xP-^+\2 + xY+^~^ 



2 + x 

3=0 



p-2 

= E 2-^'a;P-^ (2 + a;)P+^' (4) 
subtracting (|4]) from ([3]) and solving T(a;), we obtain 

T{x) = (^4^) (2i-fa;(2 + xfP - 2x^+^2 + ) 

Since is the coefficient of x^ in T(a;), the second term is useless, and 

-g(T)^j(?)-c;)--c; 
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□ 

Since p is the number of 0-segments and bounded by \m/2'] , we obtain the 
following result. 

Theorem 4 The number of connection topologies is 0(2™). 
5.2 An exact algorithm for GBCP2 

Let Lj be the list of all configurations for column j. Initially, for each binary 
m- vector z, there is only one connection topology r and only one possible 
total weight 6 of Vi. So Li contains 2"* configurations. For j from 2 to n, 
our algorithm computes Lj from Lj-i in each iteration, as well as checks if a 
better feasible solution is obtained. We say a component is "closed" at column 
J — 1 if the component contains at least one node of column j — 1 but none 
of column j. For any (z, 9, r) e ij-i and any bipartition z' of column j, we 
check the following conditions: 

— lfVq,qe {0, 1}, has exactly one component in (z, 9, t): 

— all nodes of column > j assigned to Vi-g is a feasible solution. We need 
only check if it is the currently best solution but not insert it into Lj. 

— any other z' closing the component is illegal and should be discarded. 

— Otherwise any z' closing any component at column j — 1 is illegal. 

— If it is not illegal, compute 6' and r', and insert {z',6',t') into Lj. 

Finally for each configuration of column n, we check if it is feasible and update 
the best one if necessary. We next analyze the time complexity. 
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To store the configurations, we use a, 2"'- x W table T and each entry of 
the table is a list of connection topologies. For a configuration {z,9,t), we 
store r in the list of T[z,9]. By using a balanced binary search tree (such 
as AVL tree) with array t as the key, we can check the existence and insert 
a configuration in 0{m^) time. Note that we need to check and avoid the 
duplicates to ensure the number of configurations is bounded. The number of 
configurations is 0(2"' xWx 2™) ~ 0{W x 4™). Since there are n iterations 
and in each iteration it takes 0{m?') time for every configuration and every z', 
the total time complexity is 0{im?nW^™^) = 0{mNW%"^). It is much better 
than 0{N2^) of the brute force algorithm. 

Theorem 5 The GBCP2 can be solved in 0{mNW8'^), in which N is the 
number of nodes, m is the number of rows, and W is the total weight. 

6 Approximating GBCP2 

Based on the exact algorithm for GBCP2 in the previous section and a scaling 
technique, we shall show an approximation algorithm for GBCP2, which is a 
FPTAS for fixed number of rows. 

For some p < 1, we scale down the weights by a factor r — pW/{3N), i.e., 
we set w'j^j — lwij/r\ for each i and j. Then we run the exact algorithm on 
the modified weight w' to obtain a bipartition (Vb,Vi) as the approximation 
solution. Let (Vq*, V{) be the optimal bipartition w.r.t. w. The following result 
can be derived from Lemma [6l 
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Lemma 11 // no node has weight larger than W/2, mm{'w{VQ),w {¥{')} > 

min{w{V„'),w{V,')} 



Lemma 12 """^'"^''o ^■"'^'^i < ^ 
i.emma iz ^i^{^(^Vo),w{Vi)} - (i-p) ' 

Proof W.l.o.g. let w{Vo) < w(Vi). By definition, w{Vq) > ™'(Vo). Since 
(^i^i) is the optimal bipartition w.r.t. w' , minq w' (Vq) > muiqw' (V*) and 



therefore 



w{Vo) > rw'iVo) > rminw'(V?^ 



9 « 



By the definition of w' , for q = or 1, w{Vq) < rw'{Vq) + rN. Hence, 
rminqw'{V*) > min^ w{Vq) — rN, and we obtain 



wiVo) > uimw(V*) — rN 



Then, since r = pW/{iN) and m\i\qw{V*) > W/3 by Lemma [TTl the 
approximation ratio can be calculated by 

minqw{Vq*) W/3 _ 1 



q 



min, w{Vq*) ^rN - W/3 ~ pW/3 1 - p ' 



which completes the proof. □ 

Theorem 6 For any e > 0, a (1 + e)- approximation of the GBCP2 can be 
found in 0((1 + ^)mN'^8"^) time, in which N is the number of nodes and m 
is the number of rows. For fixed number of rows, it is a FPTAS. 

Proof For any given £ > 0, we set p = and r = pW/ {3N). By Lemma [T^ 
the approximation ratio is 1 + £. By Theorem [5l the time complexity is 
OimNW'8"') = 0{mN{ ^^Y{3N) )^"') = 0{{l + ^)mN^8"'). 

a 
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7 Concluding remarks 

The approximation algorithm shown in Section 6 is an FPTAS only for fixed 
number of rows. The interesting open problems include how to design an FP- 
TAS or PTAS for the GBCP2 of non-fixed number of rows and how to evenly 
partition a (grid) graph into more than two connected subgraphs. 
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